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Abstract 

We define a new cost model for the call-by- value lambda-calculus satisfying the invariance 
thesis. That is, under the proposed cost model, Turing machines and the call-by-value lambda- 
calculus can simulate each other within a polynomial time overhead. The model only relies on 
combinatorial properties of usual beta-reduction, without any reference to a specific machine 
or evaluator. In particular, the cost of a single beta reduction is proportional to the difference 
between the size of the redex and the size of the reduct. In this way, the total cost of 
normalizing a lambda term will take into account the size of all intermediate results (as well 
as the number of steps to normal form). 

1 Introduction 

Any computer science student knows that all computational models are extensionally equivalent, 
each of them characterizing the same class of computable functions. However, the definition of 
complexity classes by means of computational models must take into account several differences 
between these models, in order to rule out unrealistic assumptions about the cost of respective 
computation steps. It is then usual to consider only reasonable models, in such a way that the 
definition of complexity classes remain invariant when given with reference to any such reasonable 
model. If polynomial time is the main concern, this reasonableness requirement take the form of 
the invariance thesis [12]: 

Reasonable machines can simulate each other within a polynomially-bounded overhead 
in time and a constant-factor overhead in space. 

Once we agree that Turing machines are reasonable, then many other machines satisfy the invari- 
ance thesis. Preliminary to the proof of polynomiality of the simulation on a given machine, is 
the definition of a cost model, stipulating when and how much one should account for time and/or 
space during the computation. For some machines (e.g., Turing machines) this cost model is obvi- 
ous; for others it is much less so. An example of the latter kind is the type-free lambda-calculus, 
where there is not a clear notion of constant time computational step, and it is even less clear how 
one should count for consumed space. 

The idea of counting the number of beta- reductions [5] is just too naive, because beta-reduction 
is inherently too complex to be considered as an atomic operation, at least if we stick to explicit 
representations of lambda terms. Indeed, in a beta step 

(Xx.M)N -> M{x/N}, 

there can be as many as \M\ occurrences of x inside M. As a consequence, M{x/N} can be as big 
as |M||A|. As an example, consider the term n 2, where n = Xx.Xy.x n y is the Church numeral for 
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n. Under innermost reduction this term reduces to normal form in 3n — 1 beta steps, but there 
is an exponential gap between this quantity and the time needed to write the normal form, that 
is 2^_. Under outermost reduction, however, the normal form is reached in an exponential number 
of beta steps. This simple example shows that taking the number of beta steps to normal form as 
the cost of normalization is at least problematic. Which strategy should we choose 1 ? How do we 
account for the size of intermediate (and final) results? 

Clearly, a viable option consists in defining the cost of reduction as the time needed to normalize 
a term by another reasonable abstract machine, e.g. a Turing machine. However, in this way we 
cannot compute the cost of reduction from the structure of the term, and, as a result, it is very 
difficult to compute the cost of normalization for particular terms or for classes of terms. Another 
invariant cost model is given by the actual cost of outermost (normal order) evaluation, naively 
implemented [8]. Despite its invariance, it is a too generous cost model (and in its essence not 
much different from the one that counts the numbers of steps needed to normalize a term on a 
Turing machine). What is needed is a machine-independent, parsimonious, and invariant cost 
model. Despite some attempts [6, 8, 9] (which we will discuss shortly), a cost model of this kind 
has not appeared yet. 

To simplify things, we attack in this paper the problem for the call-by-value lambda-calculus, 
where we do not reduce under an abstraction and we always fully evaluate an argument before firing 
a beta redex. Although simple, it is a calculus of paramount importance, since it is the reduction 
model of any call-by-value functional programming language. For this calculus we define a new, 
machine-independent cost model and we prove that it satisfies the invariance thesis for time. The 
proposed cost model only relies on combinatorial properties of usual beta-reduction, without any 
reference to a specific machine or evaluator. The basic idea is to let the cost of performing a 
beta-reduction step depend on the size of the involved terms. In particular, the cost of M — > N 
will be related to the difference \N\ — \M\. In this way, the total cost of normalizing a lambda 
term will take into account the size of all intermediate results (as well as the number of steps to 
normal form) . The last section of the paper will apply this cost model to the combinatory algebra 
of closed lambda-terms, to establish some results needed in [3] . We remark that in this algebra 
the universal function (which maps two terms M and N to the normal form of MN) adds only 
a constant overhead to the time needed to normalize MN. This result, which is almost obvious 
when viewed from the perspective of lambda-calculus, is something that cannot be obtained in 
the realm of Turing machines. 

1.1 Previous Work 

The two main attempts to define a parsimonious cost model share the reference to optimal lambda 
reduction a la Levy [10] , a parallel strategy minimizing the number of (parallel) beta steps (see [2] ) . 

Frandsen and Sturtivant [6] propose a cost model essentially based on the number of parallel 
beta steps to normal form. Their aim is to propose a measure of efficiency for functional pro- 
gramming language implementations. They show how to simulate Turing machines in the lambda 
calculus with a polynomial overhead. However, the paper does not present any evidence on the 
existence of a polynomial simulation in the other direction. As a consequence, it is not known 
whether their proposal is invariant. 

More interesting contributions come from the literature of the nineties on optimal lambda 
reduction. Lamping [7] was the first to operationally present this strategy as a graph rewriting 
procedure. The interest of this technique for our problem stems from the fact that a single beta 
step is decomposed into several elementary steps, allowing for the duplication of the argument, the 
computation of the levels of nesting inside abstractions, and additional bookkeeping work. Since 
any such elementary step is realizable on a conventional machine in constant time, Lamping's 
algorithm provides a theoretical basis for the study of complexity of a single beta step. Lawall and 
Mairson [8] give results on the efficiency of optimal reduction algorithms, highlighting the so-called 

1 Observe that we cannot take the lenght of the longest reduction sequence, both because in several cases this 
would involve too much useless work, and because for some normalizing term there is not a longest reduction 
sequence. 
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bookkeeping to be the bottleneck from the point ot view of complexity. A consequence of Lawall 
and Mairson's work is evidence on the inadequacy of the cost models proposed by Frandscn and 
Sturtivant and by Asperti [1], at least from the point of view of the invariance thesis. In subsequent 
work [9] , Lawall and Mairson proposed a cost model for the lambda calculus based on Levy's labels. 
They further proved that Lamping's abstract algorithm satisfies the proposed cost model. This, 
however, does not imply by itself the existence of an algorithm normalizing any lambda term with 
a polynomial overhead (on the proposed cost). Moreover, studying the dynamic behaviour of Levy 
labels is clearly more difficult than dealing directly with the number of beta-reduction steps. 

2 Syntax 

The language we study is the pure untyped lambda calculus endowed with lazy evaluation (that 
is, we never reduce under an abstraction) and call-by-value reduction. 

Definition 1 The following definitions are standard: 

• Terms are defined as follows: 

M ::= x | Xx.M \ MM 

A denotes the set of all lambda terms. 

• Values are defined as follows: 

V::=x\ Xx.M 

S denotes the set of all closed values. 

• Call-by-value reduction is denoted by — ► and is obtained by closing the rule 

(Xx.M)V -» M{V/x} 

under all applicative contexts. Here M ranges over terms, while V ranges over values. 

• The length \M\ of M is the number of symbols in M. 

Following [11] we consider this system as a complete calculus and not as a mere strategy for 
the usual lambda-calculus. Indeed, respective sets of normal forms are different. Moreover, the 
relation — > is not deterministic although, as we are going to see, this non-determinism is completely 
harmless. 

The way we have defined beta-reduction implies a strong correspondence between values and 
closed normal forms: 

Lemma 1 Every value is a normal form and every closed normal form is a value. 

Proof. By definition, every value is a normal form, because evaluation is lazy and, as a conse- 
quence, every abstraction is a normal form. For the other direction, we have to prove that if M 
is a closed normal form, then M is an abstraction. We proceed by induction on M. But if M is 
an application NL, then by induction hypothesis both N and L are abstractions and M is not a 
normal form. □ 

The prohibition to reduce under abstraction enforces a strong notion of confluence, the so-called 
one-step diamond property, which instead fails in the usual lambda calculus. 

Proposition 1 (Diamond Property) If M — > N and M — > L then either N = L or there is P 
such that N — > P and L — > P. 

Proof. By induction on the structure of M. Clearly, M cannot be a variable nor an abstraction 
so M = QR. We can distinguish five cases: 

• If Q = Xx.T and R is a value, then N = L = T{x/R}, because R is a variable or an abstraction. 

• If N = TR and L = UR, where Q — > T and Q — > U, then we can apply the induction 
hypothesis. 
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• Similarly, if R — ► T and R — > {/, where AT = QT and L = QC/, then we can apply the induction 
hypothesis. 

• If N = QT and L = UR, where R -> T and Q -> U, then N -> C/T and L -» f7T. 

• Similarly ]£N = UR and L ee QT, where i? -> T and Q -> U, then AT — ► J7T and L — ► [7T. 
This concludes the proof. □ 

As an easy corollary of Proposition 1 we get an equivalence between all normalization strategies — 
once again a property which does not hold in the ordinary lambda-calculus. 

Corollary 1 (Strategy Equivalence) M has a normal form iff M is strongly normalizing. 

Proof. Observe that, by Proposition 1, if M is diverging and M — > N, then A" is diverging, too. 
Indeed, if M = M — ► Mi — > M 2 — ► . . ., then we can build a sequence N = N — ► Ai — > A^ 2 — ► . . . 
in a coinductive way: 

• If Mi = AT, then we define Ni to be Mj+i for every i > 1. 

• If Mi ^ AT then by proposition 1 there is A^i such that Mi -> ATi and N —> Ni. 

Now, we prove by induction on n that if M — N, with A^ normal form, then M is strongly 
normalizing. If n = 0, then M is a normal form, then strongly normalizing. If n > 1, assume, 
by way of contraddiction, that M is not strongly normalizing. Let L be a term such that M — > 
L N. By the above observation, L cannot be strongly normalizing, but this goes against 

the induction hypothesis. This concludes the proof. □ 

But we can go even further: in this setting, the number of beta-steps to the normal form is 
invariant from the evaluation strategy: 

Lemma 2 (Parametrical Diamond Property) If M — > n A^ and M L then there is a 
term P such that N -^> l P and L P where I < m, k < n and n + I = m + k. 

Proof. We will proceed by induction on n + m. If n + m = 0, then P will be N = L = M. If 
n + m > but either n = or m = 0, the thesis easily follows. So, we can assume both n > and 
m > 0. Let now Q and R such that M — > Q -^ n ^ 1 N and M —> R -^ m - 1 L. From proposition 1, 
we can distinguish two cases: 

• Q = R, By induction hypothesis, we know there is T such that N T and L — > fe T, where 
I < m — 1 < to, and k < n — 1 < n. Moreover (n — 1) + I = (to — 1) + k, which yields 
n + I = to + k. 

• There is T with Q — > T and i? — > T. By induction hypothesis, there are two terms £/, T4^ 
such that T ->* U,T W, N U, L where i < n - 1, j < to - 1, p, g < 1, 
n — l+ p= 1 + i and m— l + q= l+ j. By induction hypothesis, there is P such that {/ P 
and W ^ s P, where r < j, s < i and r + i = s + j. But, summing up, this implies 



A" 


. r p+r 


P 


L 


y q+s 


P 


p + r 


< 


l+j<l+TO — 1 = TO 


q + s 


< 


l+z<l+n-l=n 


p + r + n 




(n-l+p) + l + r=l + i + l + r 






2 + r + i = 2 + s + j = 1 + 1 + j + s 






1+to— l+q+s=q+s+n 



This concludes the proof. □ 

Proposition 2 For every term M, there are at most one normal form N and one integer n such 
that M — >™ N. 

Proof. Suppose M — >™ A^ and M L, with AT and L normal forms. Then, by lemma 2, there 
are P, k, I such that A^ P and L P and n + I = m + k. But since N and L are normal 
forms, P = N , P = L and / = k = 0, which yields N = L and n = m. □ 
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3 An Abstract Time Measure 

We can now define an abstract time measure and prove a diamond property for it. Intuitively, 
every beta-step will be endowed with a positive integer cost bounding the difference (in size) 
between the reduct and the redex. 

Definition 2 • Concatenation of a, (3 € N* is simply denoted as a[3. 

a 

• -» will denote a subset o/AxN* x A. In the following, we will write M -» N standing for 
(M,a,N) E^>. The definition of -» (in SOS-style) is the following: 

M^N n = max{l, |JV| - \M\} M -» N N^>L 



e (n) ap 

M -» M M -» N M ^> L 

Observe we charge max{l, \N\ — \M\} for every step M — + N . In this way, the cost of a 
beta-step will always be positive. 

• Given a — (m, . . . , n m ) 6 N* 7 define \ \a\ \ — Y^T=i n »- 

The confluence properties we proved in the previous section can be lifted to this new notion on 
weighted reduction. 

(n) (m) 

Proposition 3 (Diamond Property Revisited) If M -» N and M -» L, then either N = L 

(to) (n) 

or there is P such that N -» P and L -» P. 

a a 

Proof. We can proceed as in Proposition 1. Observe that if M -» N, then ML -» NL and 

a 

LAI -» LN. We go by induction on the structure of M. Clearly, M cannot be a variable nor an 
abstraction so M = QR. We can distinguish five cases: 

• If Q = Xx.T and R is a value, then N = L = T{x/R}, because R is a variable or an abstraction. 

(n) (to) 

• If N = TR and L = UR, where Q -» T and Q -» U, then we can apply the induction 

(to) (n) (to) 

hypothesis, obtaining that T -» W and U -» W. This, in turn, implies N -» WR and 

(n) 

L -» WR. 

(n) (to) 

• Similarly, if N = QT and L = QU, where R -» T and R -» U, then we can apply the 
induction hypothesis. 

• If N = QT and L = UR, where R ^ T and Q ^ U, then N ^ UT and L ^ UT. 

• Similarly, if N = UR and L = QT, where R ^ T and Q ^ U, then N ^ UT and L ^ UT. 
This concludes the proof. □ 

a 

Lemma 3 (Parametrical Diamond Property Revisited) // M -» N and M -» L, then 
there is a term P such that N -» P and L -» P where \ \ori\\ = \ \PS\\. 

Proof. We proceed by induction on a/3. If a — f3 = e, then P will be N = L = M. If a/3 ^ but 
either a = e or (3 = e , the thesis easily follows. So, we can assume both a ^ e and (3 ^ e. Let 

(n) p (to) S 

now Q and R such that M -» Q -» N and M -» R ^> L. From Proposition 3, we can distinguish 
two cases: 

7 

• Q = R (and m = n). By induction hypothesis, we know there is T such that N -» T and 
L 4» T, where ||p 7 || = \ \aS\\, which yields ||a 7 || = \\(35\\. 

(to) (ri) 

• There is T with Q -» T and R -» T. By induction hypothesis, there are two terms U, W 
such that T 4 U, T X W, N -1 U, L ^ W, where ||pd|| = ||(m)^|| and \\<jh\\ = \ \(n)n\\. By 
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induction hypothesis, there is P such that U -» P and W -» P, where — \\v T \\- But, 

summing up, this implies 

9 v 

N -» P 

TIT 

L 4 P 

\\a6v\\ = \\{n)p6v\\ = \\{n){m)&\\ = 

= IIMWCHI = IIM(n)»?r|| = \\(m)afiT\\ = 

= \WA\ 

This concludes the proof. □ 

Proposition 4 For every term M, there are at most one normal form N and one integer n such 
that M 4 N and \ \a\ \ = n. 

a fi 

Proof. Suppose M -» N and M -» L, with N and L normal forms. Then, by Lemma 2, there 

7 S 

are P, 7, 5 such that N -» P and L -» P and | |cir'y| | = | \0S\\. But since N and L are normal forms, 
P = N, P = L and 7 = 6 = e, which yields N = L and ||a|| = \\/3\\. □ 

We are now ready to define the abstract time measure which is the core of the paper. 

Definition 3 (Difference cost model) If M -» N , where N is a normal form, then Time(M) 
is \ \a\ \ + \M\. If M diverges, then Time(M) is infinite. 

Observe that this is a good definition, in view of Proposition 4. In other words, showing M -» N 
suffices to prove Time(M) — \ \a\ \ + \M\. This will be particularly useful in the following section. 

As an example, consider again the term n 2 we discussed in the introduction. It reduces to 
normal form in one step, because we do not reduce under the abstraction. To force reduction, 
consider E = n 2 c, where c is a free variable; E reduces to 

A?M-(Ay„-i . . . (Ay 2 -(Ayi.c 2 2/i) 2 y 2 ) 2 • • -)y n 

in 9(n) beta steps. However, Time(E) = 9(2"), since at any step the size of the term is dupli- 
cated. 



4 Simulating Turing Machines 

In this and the following section we will show that the difference cost model satisfies the polynomial 
invariance thesis. The present section shows how to encode Turing machines into the lambda 
calculus. 

We denote by H the term MM, where 

M = Xx.Xf.f(Xz.xxfz). 
H is a call-by-value fixed-point operator: for every N, there is a such that 

HN 4 N(Xz.HNz) 
Ml = 0(\N\) 

The lambda term H provides the necessary computational expressive power to encode the whole 
class of computable functions. 

The simplest objects we need to encode in the lambda-calculus are finite sets. Elements of any 
finite set A = {a\, . . . , a n } can be encoded as follows: 

r aP A = Xx\ Xx n .Xi 
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Notice that the above encoding induces a total order on A such that ai < aj iff i < j. 

Other useful objects are finite strings over an arbitrary alphabet, which will be encoded using 
a scheme attributed to Scott [13]. Let E = {a\, . . . , a n } be a finite alphabet. A string in s G E* 
can be represented by a value r s nS as follows, by induction on s: 

r e nS = Azi Xx n .Xy.y 

r a i u~ lY ' = \x\ Xx n Xy.x i r u 1,T ' 

Observe that representations of symbols in S and strings in S* depend on the cardinality of E. In 
other words, if u G E* and E C A, r u nS ^ r it nA . Besides data, we want to be able to encode 
functions between them. In particular, the way we have defined numerals lets us concatenate two 
strings in linear time in the underlying lambda calculus. 

Lemma 4 Given a finite alphabet X, there are terms AC{Ti), AS(T) and AR(YA such that for 
every a G E and u, v G E* there are a, [3, 7 such that 

ylC*(E) r a nSr u nS * 
.45(E) r u nSV w nE * 
^i?(E) r ?i nS * r w nS * 

and 

INI = o(i) 
WW = 0(H) 
IMI = 0(H) 

Proof. The three terms are defined as follows: 

AC(T.) = \x.\y.xM 1 ...M\ n y 

\/i.Mi = Xy.Xxi Xx^.Xw.Xiy 

AS{T.) = H(Xx.Xy.Xz.yN 1 ...N l z\(Xw.w)z) 

\/i.Ni = Xw.Xk.(Xh.Xxi Xx^\-Xg.Xih)(xwk) 

AR(£) = H(Xx.Xy.Xz.yPi . . . Pj E | (Xw.w)z) 

Mi.Pi = Xw.Xk.xw(Xxi Xx\x\ .Xh.Xik) 

Observe that 

4C(E) r a i ~ ,Er u nE * r a i ^M 1 ...M lsl r u~ lS ' 
^> M l r u~ ,s * 4> r ai u^" 
where a does not depend on u. Now, let Ri be Ni{Xz.AS(E)z/x}. Then, we can proceed by 
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induction: 



AS(E) r £^' r v^' 


a 


(Xy.Xz.yR 1 . . . R m (Xw.w)z) r e n ^* r w nE * 




(i,i) 


r e nS *i?i . . .i?| S |(Aw.w) r w nS * 








AS(X) r a t u^" r v^" 


a 


(Xy.Xz.yRi . . . R m (Xw.w)z) r a lU ^* r v^* 




(i,i) 
^> 


r a l u^*R 1 . . . R ln (Xw.w) r v^' 




7 

^> 


Rfu^v^' 




(1,1,1) 
— » 


(Xh.Xx! Xx m .Xg.Xih)(AS(Y,) r u^ r v^ 




S 


(Xh.Xxi Xx\Y l \-Xg.x i K) r uv^ ¥ ' 




(1) 

—» 


Xx\ Ax| S | .Xg.xfuv^ 



where a, (3, 7 do no depend on u and v. Finally, let Qi be Pi{Xz.AR(T,)z/x}. Then, we can 
proceed by induction: 

AR(Y;) r e^' r v^' Z> (Xy.Xz.yQ 1 ...Q m (Xw.w)z) r e^' r v^' 



(1,1) 



r s^Qi...Q m (Xw.w) r v^' 



13 /x nE* (1) r nE* 



(1,1) 
7 

^> 

(1,1,1) 
^> 

S 



AR(E) r a lU ^" r v^" -» (Xy.Xz.yQ 1 ...Q m {Xw.w)z) r a i u^" r v^ 

r a l u^*Q 1 ...Q m {Xw.w) r v^' 

AR(Z) r u^ r ai v^* 

r u r a lV ^' = r (a iU ) r v^" 

where a, (3, 7 do not depend on u, v. □ 

The encoding of a string depends on the underlying alphabet. As a consequence, we also need to 
be able to convert representations for strings in one alphabet to corresponding representations in 
a bigger alphabet. This can be done efficiently in the lambda-calculus. 

Lemma 5 Given two finite alphabets £ and A, there are terms CC(S,A) and CS(T,,A) such 
that for every do, a\, . . . , a n £ £ there are a and (3 with 

CC(S,A) r a nE r u nA * 
CS'(S,A) r a 1 ...a n nS * 4> r Ul ...u n ^ A " 



Vi.Ui 

and 



e otherwise 



INI - o(i) 

||/3|| = 0(n) 
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Proof. The two terms are defined as follows: 
CC(£,A) 

CS(E, A) 
Vi.TV, 



Xx.xMi . . . M| S | 

r a, nA * if a, e A 
r e nA otherwise 



H{Xx.Xy.yN 1 ...N^e^*)) 



Az.(Aw.Axi Ax^i .Xh.Xiw)(xz) if a, G A 



Az.xz 



otherwise 



Observe that 



CC(E,A) r a 4 nS 



(i) 



r a 4 nS Mi 
\ r £ nA * 



•M| S | 

if a, G A 
otherwise 



Let P l be Ni{\z.CS(Z,A)z/x}. Then: 



C5(S, A) r e nE * 


a 
— » 


(Xy.yP, . . . 


P^fe^ A 're^' 




(i) 

^> 


r £ nS >l.. 






— » 


^ A * 




CS(E, A) r a lU ^ 


7 

— » 


(Ay.yPi . . . 


P| E | r e- ,A *) r a i «- |E * 




(1) 
— » 


r a l u^*P 1 






— » 


Pfu^' 






(1.1) 


f (Aiu.Asi Aa;| A |.A^.a; 4 u;)(C , 1 S'(E,A) r M nE * 
\ CS^E, A) r u nE * 



if G A 
otherwise 



where a, (3, 7, 5 do not depend on u. 



□ 



A deterministic Turing machine M is a tuple (E, a^anfc, Q, qinitial, qfinai, 5) consisting of: 

• A finite alphabet E = {ai, . . . , a„}; 

• A distinguished symbol atiank G E, called the Wanfc symbol; 

• A finite set <5 = {91, • ■ • , <? m } of states; 

• A distinguished state qinitial £ Q, called the initial state; 

• A distinguished state <?/j na ; G Q, called the /ma/ siate; 

• A partial transition function 8 : QxE^QxEx {<— , — J.} such that 8{qi 1 aj) is defined iff 

A configuration for M is a quadruple in E* x E x E* x Q. For example, if 5{qi 1 aj) — {qi,a k , <— ), 
then evolves from (ua p ,aj,v,qi) to (u, a p , a k v, qi) (and from (s,aj,v,qi) to (e,a bla nk,a k v,qi)). 
A configuration like (u,ai,v,qfi na i) is /ma/ and cannot evolve. Given a string w G E*, the initial 
configuration for u is (e, a,u, q lmtla i) if u = aw and (e, aj;^, e, q in itial) if u = £. The string 
corresponding to the final configuration (it, aj, w, qfi na i) is uajt>. 

A Turing machine {^,au an k,Q, Qinitial ,Q final, 8) computes the function / : A* — > A* (where 
A C E) in time g : N —> N iff for every u G A*, the initial configuration for u evolves to a final 
configuration for f(u) in g(|w|) steps. 

A configuration (s,a,t,q) of a machine M = i^,au an k,Q, Qinitial, Q final, 8) is represented by 
the term 

r {u,a,v, q y M = Xx.x r u r ^' r a nE r v nE * r q nQ 

We now encode a Turing machine M — (^,a i a nk,Q, qinitial, q final, $) in the lambda-calculus. 
Suppose E = {ai, . . . , a|s|} and Q = {gi, . . . , <7|Q|} We proceed by building up three lambda 
terms: 
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• First of all, we need to be able to build the initial configuration for u from u itself. This can 
be done in linear time. 

• Then, we need to extract a string from a final configuration for the string. This can be done 
in linear time, too. 

• Most importantly, we need to be able to simulate the transition function of M, i.e. compute 
a final configuration from an initial configuration (if it exists). This can be done with cost 
proportional to the number of steps M takes on the input. 

The following three lemmas formalize the above intuitive argument: 

Lemma 6 Given a Turing machine M. — (E,abiank,Q,qinitiai,qfinai,S) and an alphabet ACE 
there is a term I(A4,A) such that for every u 6 A* ; 7(X,A) r M nA * -» r C nM where C is the 
initial configuration for u and \ \a\ \ = 0(\u\). 

Proof. I(M, A) is defined as 

H{\x.\y.yM 1 ...M m N) 

where 

Mi = Xz.(xz)(Xu.Xa.Xv.Xq.Xw.(^x.xu r a i nJ: wq)(AC('S)av)) 
Let P be M l {Xz.I{M, A)z/x}. Then 



7(.M,A) r £ nA * 


Ct 
— » 


(Xy.yP 1 ...P lAl Nre^* 




(i) 

^> 


r ^ A *P!...P| A |iV 




13 
—» 


N = r {^i a blanki £•> Qinitial^ 


I(M,A) r a lU ^ A ' 


a 

—» 


(Aj/.yP 1 ...P| A |7V) r a^ A * 




(1) 

^> 


r a i u^*P 1 ...I] A \N 







P r w nA * 




(i) 

^> 


(I(M, A) r u~ lA ')(Xu.Xa.Xv.Xq.Xw.(Xx.xu r a,i~ li: wq)(AC(T,)av)) 




7 
^> 


r D^ M {Xu.Xa.Xv.Xq^w.{Xx.xu r a l ^wq){AC{Y 1 )av)) 



where a, (3 do not depend on u and D is and initial configuration for u. Clearly 

r D nM (Xu.Xa.Xv.Xq^w-(^x.xu r a t ^wq)(AC(Y,)av)) r E nM 

where E is an initial configuration for aiU. □ 

Lemma 7 Given a Turing machine M — (E, abiankiQ^QinitiaUQ 'final and for every alphabet A, 
there is a term F(A4, A) such that for every final configuration C for U\ . . .u n there is a such that 

F(M,A) r C nM ^» r v 1 ...v n ^ A \ \\a\\ = 0(n) and 

Vi.v = { Ul lfUl E A 
1 \ e otherwise 

Proof. F(M,A) is defined as 

Xx.x(Xu.Xa.Xv.Xq-AR(Z){CS(Y,, A)u)(AS(E)(CC(E, A)a)(C5(E, A)v)) 
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Consider an arbitrary final configuration r (u, a, v, qp, n aip M ■ Then: 
F{M,Ay(u,a,v,q fina iy M 

AR(Z)(CS(Z, A) r u^*)(AS(Z)(CC(E, A) r a^)(CS(Z, A) r ^ s *)) 
4> AR(Z) r u^ A " r av^ A ' 



^ r r -iA* 

-» u av 1 



where a = 0(|u| + |u|), /3 does not depend on u, v and 7 = 0(|u|). □ 

Lemma 8 Given a Turing machine A4 = CE,ai,iank>QiQinitiaU<lfinah$), there is a term T(A4) 
such that for every configuration C: 

• If D is a final configuration reachable from C in n steps, then there exists a such that 
T(M) r C nM 4> r D iM. moreover || a || = o(n); 

• The term T(A4) r C~ lM diverges if there is no final configuration reachable from C . 

Proof. T(M) is defined as 

H(Xx.Xy.y(\u.Xa.Xv.\q.q(M 1 . . . M\q\)uclv)) 

where 

Vi.Afj = Xu.Xa.Xv.a{Nl ...JVj E| )«« 

Aw.Aw.Ax.xu r a : , " |S w r (7i~ lQ if % = q^„ a / 

Au.Av.x(Az.zu r a fc ~ ,s u r q ; ~ ,Q ) if 6(qi,aj) = (qi,a k , j) 



Vi,j.JV,f EE 



Aw.A W .x(uP 1 ...R S |P(AC(E) r a^ s V ) r ( ? r Q ) if Sfaaj) = 



-| E |J-^o^Zj;- Ufe vyqi^) no^qi,Uj) 
Xu.Xv.xivRx . . . R lJ:l R{AC(Y,) r a k ^u) r qi n Q) if 5(qi,aj) = (qi,a k ,- 

Vi.Pj = AM.Aw.A<7.Ax.xw r ai nS ug 

P ee Au.A<7.Ax.x r e nS r a b iank~' T} vq 

Mi.Ri ee Au.Aii.A<7.Ax.xw r ai nS ug 

P ee Au.A<7.Ax.xu r ab; an fe~ |Sr e~ |S g 

To prove the thesis, it suffices to show that 

T(M) r C nM 4» T(M) r E nM 

where E is the next configuration reachable from C and (3 is bounded by a constant independent 
of C. We need a number of abbreviations: 

\/i.Q l ee M 4 {Az.T(X)z/x} \Ji.U l ee P{Az.T(X)z/x} Vi.Wi ee R l {Xz.T{M)z / x} 
Vi,j.T? = Nf{Xz.T{M)z/x} U = P{Xz.T{M)z / x} W = R{Xz.T(M)z/x} 

Suppose C = (u, dj,v, qi). Then 

T(M) r C^ M ^ r q nQ Ql ^ QlQl r u ^r a ^r v ^ 

4» Q i r u~ |E * r aj~ ,Er i;~ ,E * 

4> T/ r w nEV v nS * 

where 7, <5, p do not depend on C. Now, consider the following four cases, depending on the value 
of S(q l ,a j ): 
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If 8{qi 1 aj) is undefined, then % = qp, n ai and, by definition T- = Xu.Xv.Xx.xu r aj^v r qP® . As 
a consequence, 

T/ r ti nS * r « nS * Ax.a; r u nS * r a J nSr w nEV % nQ 
= r (u,a j ,v,q i )' lM 

If 5(qi,dj) = (qi,dk, |), then T/ = Xu.Xv.(Xz.T(M)z)(Xz.zu r ak~ l ^v r qi~"^). As a consequence, 
nir„nr r „nE* ^ (Az.T(A4)z)(Az.z r w nSV a fc nSr v nE * r ^ nC? ) 



(1) 

= T(A4) r S nA1 

US(qi,aj) = (qi,dk,<—), then 
As a consequence, 



- T(M)(Xz.z r u^ r a k ^ r v^ r qP Q ) 



Xu.Xv.xiuU! . . . C/| E |C/(.4C , (S) r a J nS w) r gi nQ ). 



T/V 2 * 1 "^* (Az.T(A4)z)( r u nS *[/! . . . ^| E |tf (4<7(£) r a 7 -~ ,Er ?;~ ,E *) r ©~ ,Q ) 

Now, if u is e, then 

(Xz.T(M)z)( r u^'U 1 . . . f7| E |J7( J 4C(S) r aj" ,Er t;" ,E *) r ®" ,Q ) 
4» (Az.T(A4)z)[/(.lC*(S) r a J - nEr i; nS * ) r g ; nQ 
-i (Az.T(A4)z)[/( r a J -w nS *) r g/ nQ ) 



(1,1) M 

-» (Az.r(A4)z) r (£,a 6ianfc ,a fe w,^)" | - /vl 

(l) M 

-» T(A4) r (e,a 6ia „ fe ,a fc u,ft) 



where 77, £ do not depend on C. If u is ta p , then 

(Xz.T(M)z){ r u r ^'U 1 ...t/| S |t/(AC(S) r a j " |Sr i;" |S *) r gj" lQ ) 
4> (Az.T(X)z)[/ p r f^ E * (^(£) r a^ Er ^ E * ) r ^ Q 



4> (Az.T(M)z)C/ p r f^ E * ( r a fe i^ E * ) r ^ Q ) 



(1,1) 

^> 

(1,1) 



-» (Az.T(M)z) r (t,a J ,a feW , g; ) n ' M 



T(M) r (t,a v a k v,qi) 



where n, 9 do not depend on C. 
• The case d(qi,a.j) = (qi,ah,— can be treated similarly. 
This concludes the proof. □ 

At this point, we can give the main simulation result: 

Theorem 1 If f : A* — > A* is computed by a Turing machine M in time g, then there is a 
term U(M,A) such that for every u £ A* there is a with U(M 1 A) r M nA -» r f(u)~ lA and 
\\a\\=0(g(\u\)) 

Proof. Simply define U(M, A) = Xx.F(M, A)(T(M)(I(M, A)x)). □ 

Noticeably, the just described simulation induces a linear overhead: every step of M. corresponds 
to a constant cost in the simulation, the constant cost not depending on the input but only on A4 
itself. 
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5 Evaluating with Turing Machines 



We informally describe a Turing machine 1Z computing the normal form of a given input term, if 
it exists, and diverging otherwise. If M is the input term, 1Z takes time 0((Time{M)) A ). 

First of all, let us observe that the usual notation for terms does not take into account the 
complexity of handling variables, and substitutions. We introduce a notation in the style of 
deBruijn [4], with binary strings representing occurrences of variables. In this way, terms can be 
denoted by finite strings in a finite alphabet. 

Definition 4 • The alphabet 6 is {A, @,0, !.,►}. 

• To each lambda term M we can associate a string M# e 6 + in the standard deBruijn way, 
writing @ for (prefix) application. For example, if M = (Xx.xy)(Xx.Xy.Xz.x), then M# is 

@A@*-0*AAA*10 

In other words, free occurrences of variables are translated into while bounded occurrences 
of variables are translated into ► s, where s is the binary representation of the deBruijn index 
for that occurrence. 

• The true length ||M|| of a term M is the length of M* . 

Observe that ||M|| grows more than linearly on \M\: 

Lemma 9 For every term M, \\M\\ — 0(\M\ log \M\). There is a sequence {M n } neN such that 
\M n \ = Q(n), while \\M n \\ = e(\M n \log\M n \). 

Proof. Consider the following statement: for every M, the string M# contains at most 2\M\ — 1 
characters from {A, @} and at most \M\ blocks of characters from {0, !.,►}, the length of each of 
them being at most 1 + [log 2 \M\~\. We proceed by induction on M: 

• If M is a variable x, then M# is The thesis is satisfied, because \M\ = 1. 

• If M is Xx.N, then M# is Am, where u is obtained from iV # by replacing some blocks in the 
form ►withes, where \s\ is at most |~log 2 \M\~\. As a consequence, the thesis remains satisfied. 

• If M is NL, then M* is @N*L* and the thesis remains satisfied. 
This proves ||M|| = 0(\M\ log \M\). For the second part, define 

^ n times ^ n + i times 
M n = Ax. Ay Xy . xT^~~x 

Clearly, 

n + 1 times n times 
M* = X...X ©►u. 

where u is the binary coding of n (so \u\ = 6(logn)). As a consequence: 

|M| = 3n + 3 = 6(n); 
||M|| = \M*\ = 3n + 3 + n\u\ = G(nlogn). 

This concludes the proof. □ 

1Z has nine tapes, expects its input to be in the first tape and writes the output on the same 
tape. The tapes will be referred to as Current (the first one), Preredex, Functional, Argument, 
Postredex, Reduct, StackTerm, StackRedex, Counter. 1Z operates by iteratively performing the 
following four steps: 

1. First of all, 1Z looks for redexes in the term stored in Current (call it M), by scanning 
it. The functional part of the redex will be put in Functional while its argument is copied 
into Argument. Everything appearing before (respectively, after) the redex is copied into 
Preredex (respectively, in Postredex) . If there is no redex in M, then 1Z halts. For example, 
consider the term (Xx.Xy.xyy)(Xz.z){Xw.w) which becomes @@AA@@ ►!►()► OA ► OA ►() 
in deBruijn notation. Table 1 summarizes the status of some tapes after this initial step. 
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Tabic 1: The status of some tapes after step 1 
Preredex @@ 
Functional AA@@ ► 1 ► ► 



Argument A ► 

Postredex A ► 



Table 2: How stack evolves while processing @A^0A^0 



QA^A^O 
OA^OA^O 

@>0A^0 
@A*0A*0 
@A^0A^0 

@A^0>0 




£ 



2. Then, 1Z copies the content of Functional into Reduct, erasing the first occurrence of A and 
replacing every occurrence of the bounded variable by the content of Argument. In the 
example, Reduct becomes A@@A^0^0^0. 

3. 1Z replaces the content of Current with the concatenation of Preredex, Reduct and Postredex 
in this particular order. In the example, Current becomes @A@@A ► ► ► OA ► 0, which 
correctly correspond to (Xy.(Xz.z)yy)(Xw.w). 

4. Finally, the content of every tape except Current is erased. 

Every time the sequence of steps from 1 to 4 is performed, the term M in Current is replaced by 
another term which is obtained from M by performing a normalization step. So, 1Z halts on M if 
and only if M is normalizing and the output will be the normal form of M . 

Tapes StackTerm and StackRedex are managed in the same way. They help keeping track of 
the structure of a term as it is scanned. The two tapes can only contain symbols A\, F@ and S@. 
In particular: 

• The symbol A\ stands for the argument of an abstraction; 

• the symbol F@ stands for the first argument of an application; 

• the symbol S@ stands for the second argument of an application; 

StackTerm and StackRedex can only be modified by the usual stack operations, i.e. by pushing 
and popping symbols from the top of the stack. Anytime a new symbol is scanned, the underlying 
stack can possibly be modified: 

• If @ is read, then F@ must be pushed on the top of the stack. 

• If A is read, then A\ must be pushed on the top of the stack. 

• If ► is read, then symbols S@ and A\ must be popped from the stack, until we find an occurrence 
of F@ (which must be popped and replaced by S@) or the stack is empty. 

For example, when scanning the term @A^0A^0, the underlying stack evolves as in table 2 (the 
symbol currently being read is underlined). Now, consider an arbitrary iteration step, where M 
is reduced to N. We claim that the steps 1 to 4 can all be performed in 0((||M|| + ||./V||) 2 ). The 
following is an informal argument. 

• Step 1 can be performed with the help of auxiliary tapes StackTerm and StackRedex. Current 
is scanned with the help of StackTerm. As soon as 1Z encounter a A symbol in Current, it 
treats the subterm in a different way, copying it into Functional with the help of StackRedex . 
When the subterm has been completely processed (i.e. when StackRedex is becomes empty), 
the machine can verify whether or not it is the functional part of a redex. It suffices to check 
the topmost symbol of StackTerm and the next symbol in Current. We are in presence of a 
redex only if the topmost symbol of StackTerm is F@ and the next symbol in Current is either 
A or Then, 7Z proceeds as follows: 
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• If we are in presence of a redex, then the subterm corresponding to the argument is copied 
into Argument, with the help of StackRedex; 

• Otherwise, the content of Functional is moved to Preredex and Functional is completely 
erased. 

• Step 2 can be performed with the help of StackRedex and Counter. Initially, TZ simply writes 
into Counter, which keeps track of A-nesting depth of the current symbol (in binary notation) 
while scanning Functional. StackRedex is used in the usual way. Whenever we push A\ 
into StackRedex, Counter is incremented by 1, while it is decremented by 1 whenever A \ is 
popped from StackRedex. While scanning Functional, TZ copies everything into Reduct. If TZ 
encounters a it compares the binary string following it with the actual content of Counter. 
Then it proceeds as follows: 

• If they are equal, TZ copies to Reduct the entire content of Argument. 

• Otherwise, TZ copies to Reduct the representation of the variable occurrences, without 
altering it. 

Lemma 10 If M N, then n < Time(M) and \N\ < Time(M). 

Proof. Clear from the definition of Time(M). □ 
Theorem 2 TZ computes the normal form of the term M in 0((Time(M)) 4 ) steps. 

6 Closed Values as a Partial Combinatory Algebra 

If U and V are closed values and UV has a normal form W (which must be a closed value), then we 
will denote W by {U}(V). In this way, we can give 5 the status of a partial applicative structure, 
which turns out to be a partial combinatory algebra. The abstract time measure induces a finer 
structure on S, which we are going to illustrate in this section. In particular, we will be able to 
show the existence of certain elements of S having both usual combinatorial properties as well as 
bounded behaviour. These properties are exploited in [3], where elements of 5 serves as (bounded) 
realizers in a semantic framework. 

In the following, Time({U}(V)) is simply Time{UV) (if it exists). Moreover, (V, U) will denote 
the term Xx.xVU. 

First of all, we observe the identity and basic operations on couples take constant time. For 
example, there is a term M swap such that {M swap }((V, U)) = (U, V) and Time({M swap }((V, U) j) = 
5. Formally: 

Propositions (Basic Operators) There are terms M^, M swap , M ass i, M tens £ S and con- 
stants Cid, c swap , c ass i, c\ ens and cf ens such that, for every V,U,W £ S 7 there is Y £ S such 
that 

{M ld }(V) = 
{M swap }((V,U)) = 
{M assl }((V,(U,W)}) = 
{M tens }(V) = 
{Y}((U,W)) = 
Time({M ld }(V)) < 
Time{{M swap }((V,U))) < 
Time({M assl }{{V,{U,W)))) < 
Time({M tens }(V)) < 
Time{{Y}{(U,W))) < 



V 

(U,V) 

((V,U),W) 

Y 

({V}(U),W) 

Cid 
Cswap 
C-assl 
Wens 

c 2 tens + nme({V}(U)) 
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Proof. First of all, let us define terms: 

Mid = Xx.x 
Mswap = Xx.x(Xy.Xw.Xz.zwy) 
M ass i = Xx.x(Xy.Xw.w(Xz.Xq.Xr.r(Xs.syz)q)) 
Mtens = Xs.Xx.x(Xy.Xw.(Xx.Xz.zxw)(sy)) 

Now, let us observe that 

(i) 

M ld V 

(V, U){Xy.Xw.Xz.zwy) 
{Xy.Xw.Xz.zwy)VU 
(Xw.Xz.zwV)U 
(Xw.Xz.zwV)U 
(U,V) 

(V, (U, W)){Xy.Xw.w(Xz.Xq.Xr.r(Xs.syz)q)) 
(Xy.Xw.w(Xz.Xq.Xr.r(Xs.syz)q))V (U, W) 
(Xw.w(Xz.Xq^r.r(Xs.sVz)q))(U,W) 
(U, W)(Xz.Xq^r.r(Xs.sVz)q) 
(Xz.Xq.Xr.r(Xs.sVz)q)UW 
Xr.r(Xs.sVU)W) = ((V,U),W) 
Xx.x(Xy.Xw.{Xx.Xz.zxw)(Vy)) = Y 
(U, W)(Xy.Xw.(Xx.Xz.zxw)(Vy)) 
(Xy.Xw.(Xx.Xz.zxw){Vy))UW 
(Xw.(Xx.Xz.zxw)(VU))W 
(Xx.Xz.zxW)(VU) 

□ 

There is a term in S which takes as input a pair of terms (V, U) and computes the composition 
of the functions computed by V and U. The overhead is constant, i.e. do not depend on the 
intermediate result. 

Proposition 6 (Composition) There are a term M conc G S and two constants c concl c 2 conc such 
that, for every V, U,W, Z £ E, there is X £S such that: 

{M conc }((V,U)) = X 

{X}{W) = {V}({U}(W)) 

Time({M conc }((V,U))) < c\ onc 

Time({X}(W)) < c 2 conc + Time({U}(W)) + Time({V}({U}(W))) 

Proof. First of all, let us define term: 

M conc = Xx.x(Xx.Xy.Xz.x(yz)) 



M swap (V,U) 



M assl (V,(U,W)) 



MtcnsV 

Y(U,W) 



(i) 

^> 

(i) 

^> 

(i) 

^> 

(i) 

^> 

(i) 

^> 

(i) 

^> 

(i) 

^> 

(i) 

^> 

(i) 

^> 

(i) 

— » 

(i) 

^> 

(i) 
— » 

(i) 

^> 

(i) 

^> 

(i) 

^> 

(i) 
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Now, let us observe that 



M conc (V,U) 



XW 



(1) 
(1) 

— » 

(1) 

— » 

(1) 



(V,U)(\x.\y.\z.x(yz)) 
(Xx.Xy.Xz.x(yz))VU 
(Xy.Xz.V(yz))U ^ Xz.V(Uz) 
V(UW) 



X 



□ 



We need to represent functions which go beyond the realm of linear logic. In particular, terms 
can be duplicated, but linear time is needed to do it. 

Proposition 7 (Contraction) There are a term M cont £ 5 and a constant c cont such that, for 
every V € S: 

{M cont }(V) = (V,V) 
Time({M cont }{V)) < c cont + \V\. 

Proof. First of all, let us define term: 



Xx.Xy.yxx 



Now, let us observe that 



where n < \ \V\ 



M cont V ( ^ (V,V), 



□ 



From a complexity viewpoint, what is most interesting is the possibility to perform higher-order 
computation with constant overhead. In particular, the universal function is realized by a term 
M eval such that {M eval }({V,U}) = {V}(U) and Time({M eval }((V, U))) =4 + Time({U}(V)). 



Proposition 8 (Higher-Order) There are terms M eva i, M curry e S and constants c eva i, c\ 



-■curry ; ^ curry 



such that, for every V,U gS, there are W,X,Y,Z S S such that: 
{M eval }((V,U)) = {V}(U) 



-'curry > 



{M curry }(V) 
{W}(X) 
{Y}(Z) 

Time({M eval }((V,U))) 
Time({M curry }{V)) 
Time({W}(X)) 
Time{{Y}{Z)) 



= W 
= Y 

= {V}((X,Z)) 

< c eva i + Time({U}(V)) 

< 
< 
< 



l 

curry 
.2 

curry 
.3 

curry 



+ Time({V}((X, Z))) 



Proof. Define: 



M, 

M c 



eval 



Xx.x(Xy.Xw.yw) 
Xx.Xy.Xw.x(Xz.zyw) 
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Now, observe that 



M eval (V,U) 


(i) 


(V,U)(Xy.Xw.yw) 




(i) 


(Xy.Xw.yw)VU 




(i) 

— » 


(Xw.Vw)U ^ VU 


curry V 


(i) 

^> 


Xy.Xw.V(Xz.zyw) = W 


wx 


(i) 

^> 


Xw.V(Xz.zXw) = W = Y 


YZ 


(i) 

^> 


V(Xz.zXZ) = V(X, Z) 



□ 

The fact that a "universal" combinator with a constant cost can be defined is quite remarkable. It 
is a consequence of the inherent higher-order of the lambda-calculus. Indeed, this property does 
not hold in the context of Turing machines. 

7 Conclusions 

We have introduced and studied the difference cost model for the pure, untyped, call-by-value 
lambda-calculus. The difference cost model satisfies the invariance thesis, at least in its weak 
version [12]. We have given sharp complexity bounds on the simulations establishing the invariance 
and giving evidence that the difference cost model is a parsimonious one. We do not claim this 
model is the definite word on the subject. More work should be done, especially on lambda-calculi 
based on other evaluation models. 

The availability of this cost model allows to reason on the complexity of call-by- value reduction 
by arguing on the structure of lambda-terms, instead of using complicated arguments on the details 
of some implementation mechanism. In this way, we could obtain results for eager functional 
programs without having to resort to, e.g., a SECD machine implementation. 

We have not treated space. Indeed, the very definition of space complexity for lambda- 
calculus — at least in a less crude way than just "the maximum ink used [8]" — is an elusive subject 
which deserves better and deeper study. 
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